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SEQUENCE LISTING 



<110> Chang, Sandra P 

Christopher, David A 
Vine, Benjamin 
Su, Wei-Wen 
Bugos , Robert 

<120> PLASMODIUM FALCIPARUM MEROZOITE SURFACE PROTEIN- 1 MALARIA PRODUCED 
IN TRANSGENIC PLANTS 

<130> A-71339/RFT/TAL/NBC 

<140> US 10/098,514 
<141> 2002-03-11 

<150> US 09/500,376 
<151> 2000-02-08 

<150> US 60/274,599 
<151> 2001-03-09 

<160> 19 

<170> Patentln version 3.1 

<210> 1 

<211> 1149 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic 
<220> 

<221> sig_peptide 

<222> (1)..(24) 
<223> 



<400> 1 
gccgaattcg 


acaacatcct 


cagtgacaac 


atcctcagtg 


gcttcgagaa 


cgagtacgac 


60 


gtaatctacc 


taaagcccct 


tgccggtgtc 


taccgttcat 


tgaagaaaca 


gatagaaaag 


120 


aatattttca 


cgttcaacct 


caacctaaat 


gacatcctca 


actcgcgcct 


caagaagcga 


180 


aaatacttcc 


tcgacgtgtt 


ggaatccgac 


cttatgcaat 


tcaagcacat 


tagctctaac 


240 


gagtacatca 


tagaggacag 


cttcaagctc 


ttgaattcag aacagaagaa 


caccctccta 


300 


aagtcctaca 


aatacattaa ggagtctgtt 


gagaacgaca 


tcaagttcgc 


ccaggaagga 


360 


attagctact 


atgagaaagt 


cctggctaaa 


tacaaggacg 


acttggaaag 


cattaagaag 


420 


gtaatcaaag 


aagagaagga 


aaagtttccg 


agctctccac 


ccacaactcc 


cccatcgcct 


480 


gcaaagaccg 


acgagcagaa 


aaaagaaagt 


aagttccttc 


cattcctcac 


caacatcgaa 


540 


actctatata 


acaacctggt 


gaacaagatt 


gatgactact 


taatcaactt 


gaaggcgaaa 


600 



1 



attaatgact 


gtaacgtcga 


aaaggatgaa 


gcccacgtta 


agatcaccaa 


gctttccgat 


ctcaaagcca 


tcgacgataa 


gattgacctg 


tttaagaacc 


acaacgattt 


cgacgcaatc 


aaaaagttga 


tcaacgacga 


tactaagaaa gacatgcttg gaaaactgct gtcgacaggc 


ttggtccaaa 


acttcccgaa 


caccattata 


agcaagctga 


tcgaaggaaa gtttcaggat 


atgctgaaca 


tctctcagca 


tcaatgcgtg 


aagaagcaat gtcccgagaa 


ttcaggttgc 


ttccgccact 


tagacgaaag 


ggaggaatgt 


aaatgcctgc 


tgaattataa 


acaggaagga 


gacaagtgcg 


tagagaatcc 


taacccaacc 


tgtaacgaaa 


ataacggtgg 


ctgcgatgct 


gacgctaagt 


gtaccgagga 


ggacagcggt 


tccaatggca 


agaaaataac 


ttgcgaatgc 


acgaagcccg 


atagttaccc 


tctcttcgac 


ggtatcttct 


gctcccatga 


tgagctttaa 


gagctcacc 













<210> 2 

<211> 383 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> synthetic 
<220> 

<221> MIS C_FE ATURE 

<222> (380) . . (380) 

<223> "Xaa" at position 380 represents a stop codon 



<400> 2 

Ala Glu Phe Asp Asn lie Leu Ser Asp Asn lie Leu Ser Gly Phe Glu 
15 10 15 



Asn Glu Tyr Asp Val lie Tyr Leu Lys Pro Leu Ala Gly Val Tyr Arg 
20 25 30 



Ser Leu Lys Lys Gin lie Glu Lys Asn lie Phe Thr Phe Asn Leu Asn 
35 40 45 



Leu Asn Asp lie Leu Asn Ser Arg Leu Lys Lys Arg Lys Tyr Phe Leu 
50 55 60 



Asp Val Leu Glu Ser Asp Leu Met Gin Phe Lys His lie Ser Ser Asn 
65 70 75 80 



Glu Tyr lie lie Glu Asp Ser Phe Lys Leu Leu Asn Ser Glu Gin Lys 
85 90 95 



660 
720 
780 
840 
900 
960 
1020 



1149 



2 



Asn Thr Leu Leu Lys Ser Tyr Lys Tyr He Lys Glu Ser Val Glu Asn 
100 105 110 



Asp He Lys Phe Ala Gin Glu Gly He Ser Tyr Tyr Glu Lys Val Leu 
115 120 125 



Ala Lys Tyr Lys Asp Asp Leu Glu Ser He Lys Lys Val He Lys Glu 
130 135 140 



Glu Lys Glu Lys Phe Pro Ser Ser Pro Pro Thr Thr Pro Pro Ser Pro 
145 150 155 160 



Ala Lys Thr Asp Glu Gin Lys Lys Glu Ser Lys Phe Leu Pro Phe Leu 
165 170 175 



Thr Asn He Glu Thr Leu Tyr Asn Asn Leu Val Asn Lys lie Asp Asp 
180 185 190 



Tyr Leu He Asn Leu Lys Ala Lys He Asn Asp Cys Asn Val Glu Lys 
195 200 205 



Asp Glu Ala His Val Lys He Thr Lys Leu Ser Asp Leu Lys Ala He 
210 215 220 



Asp Asp Lys lie Asp Leu Phe Lys Asn His Asn Asp Phe Asp Ala He 
225 230 235 240 



Lys Lys Leu He Asn Asp Asp Thr Lys Lys Asp Met Leu Gly Lys Leu 
245 250 255 



Leu Ser Thr Gly Leu Val Gin Asn Phe Pro Asn Thr He lie Ser Lys 
260 265 270 



Leu lie Glu Gly Lys Phe Gin Asp Met Leu Asn lie Ser Gin His Gin 
275 280 285 



Cys Val Lys Lys Gin Cys Pro Glu Asn Ser Gly Cys Phe Arg His Leu 
290 295 300 



Asp Glu Arg Glu Glu Cys Lys Cys Leu Leu Asn Tyr Lys Gin Glu Gly 
305 310 315 320 



Asp Lys Cys Val Glu Asn Pro Asn Pro Thr Cys Asn Glu Asn Asn Gly 
325 330 335 



Gly Cys Asp Ala Asp Ala Lys Cys Thr Glu Glu Asp Ser Gly Ser Asn 



3 



340 



345 



350 



Gly Lys Lys lie Thr Cys Glu Cys Thr Lys Pro Asp Ser Tyr Pro Leu 
355 360 365 

Phe Asp Gly lie Phe Cys Ser His Asp Glu Leu Xaa Glu Leu Thr 
370 375 380 

<210> 3 

<211> 1149 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic 
<220> 

<221> sig_peptide 

<222> (1)..(24) 
<223> 

<400> 3 



cggatccaag 


gagatataac 


aatggacaac 


atcctcagtg 


gcttcgagaa 


cgagtacgac 


60 


gtaatctacc 


taaagcccct 


tgccggtqtc 


taccgttcat 


tgaagaaaca 


gatagaaaag 


120 


aatattttca 


cgttcaacct 


caacctaaat 


gacatcctca 


actcgcgcct 


caagaagcga 


180 


aaatacttcc 


tcgacgtgtt 


ggaatccgac 


cttatgcaat 


tcaagcacat 


tagctctaac 


240 


gagtacatca 


tagaggacag 


cttcaagctc 


ttgaattcag 


aacagaagaa 


caccctccta 


300 


aagtcctaca 


aatacattaa 


ggagtctgtt 


gagaacgaca 


tcaagttcgc 


ccaggaagga 


360 


attagctact 


atgagaaagt 


cctggctaaa 


tacaaggacg 


acttggaaag 


cattaagaag 


420 


gtaatcaaag 


aagagaagga 


aaagtttccg 


agctctccac 


ccacaactcc 


cccatcgcct 


480 


gcaaagaccg 


acgagcagaa 


aaaagaaagt 


aagttccttc 


cattcctcac 


caacatcgaa 


540 


actctatata 


acaacctggt 


gaacaagatt 


gatgactact 


taatcaactt 


gaaggcgaaa 


600 


attaatgact 


gtaacgtcga 


aaaggatgaa 


gcccacgtta 


agatcaccaa 


gctttccgat 


660 


ctcaaagcca 


tcgacgataa 


gattgacctg 


tttaagaacc 


acaacgattt 


cgacgcaatc 


720 


aaaaagttga 


tcaacgacga 


tactaagaaa 


gacatgcttg 


gaaaactgct 


gtcgacaggc 


780- 


ttggtccaaa 


acttcccgaa 


caccattata 


agcaagctga 


tcgaaggaaa 


gtttcaggat 


840 


atgctgaaca 


tctctcagca 


tcaatgcgtg 


aagaagcaat 


gtcccgagaa 


ttcaggttgc 


900 


ttccgccact 


tagacgaaag 


ggaggaatgt 


aaatgcctgc 


tgaattataa 


acaggaagga 


960 


gacaagtgcg 


tagagaatcc 


taacccaacc 


tgtaacgaaa 


ataacggtgg 


ctgcgatgct 


1020 


gacgctaagt 


gtaccgagga 


ggacagcggt 


tccaatggca 


agaaaataac 


ttgcgaatgc 


1080 



4 



acgaagcccg atagttaccc tctcttcgac ggtatcttct gctcccatga tgagctttaa 
gagctcacc 



1140 



1149 



<210> 


4 


<211> 


383 


<212> 


PRT 


<213> 


Artificial sequence 


<220> 




<223> 


synthetic 


<220> 




<221> 


MIS C_FE ATURE 


<222> 


(380) . . (380) 


<223> 


"Xaa" at position 380 represents 


<400> 


4 


Arg lie Gin Gly Asp lie Thr Met Asp Asn 


1 


5 10 



15 



Asn Glu Tyr Asp Val lie Tyr Leu Lys Pro Leu Ala Gly Val Tyr Arg 
20 25 30 



Ser Leu Lys Lys Gin lie Glu Lys Asn lie Phe Thr Phe Asn Leu Asn 
35 40 45 



Leu Asn Asp lie Leu Asn Ser Arg Leu Lys Lys Arg Lys Tyr Phe Leu 
50 55 60 



Asp Val Leu Glu Ser Asp Leu Met Gin Phe Lys His lie Ser Ser Asn 
65 70 75 80 



Glu Tyr lie lie Glu Asp Ser Phe Lys Leu Leu Asn Ser Glu Gin Lys 
85 90 95 



Asn Thr Leu Leu Lys Ser Tyr Lys Tyr lie Lys Glu Ser Val Glu Asn 
100 105 110 



Asp lie Lys Phe Ala Gin Glu Gly lie Ser Tyr Tyr Glu Lys Val Leu 
115 120 125 



Ala Lys Tyr Lys Asp Asp Leu Glu Ser lie Lys Lys Val lie Lys Glu 
130 135 140 



Glu Lys Glu Lys Phe Pro Ser Ser Pro Pro Thr Thr Pro Pro Ser Pro 
145 150 155 160 



5 



Ala Lys Thr Asp Glu Gin Lys Lys Glu Ser Lys Phe Leu Pro Phe Leu 
165 170 175 



Thr Asn lie Glu Thr Leu Tyr Asn Asn Leu Val Asn Lys lie Asp Asp 
180 185 190 



Tyr Leu lie Asn Leu Lys Ala Lys lie Asn Asp Cys Asn Val Glu Lys 
195 200 205 



Asp Glu Ala His Val Lys lie Thr Lys Leu Ser Asp Leu Lys Ala lie 
210 215 220 



Asp Asp Lys lie Asp Leu Phe Lys Asn His Asn Asp Phe Asp Ala lie 
225 230 235 240 



Lys Lys Leu lie Asn Asp Asp Thr Lys Lys Asp Met Leu Gly Lys Leu 
245 250 255 



Leu Ser Thr Gly Leu Val Gin Asn Phe Pro Asn Thr lie lie Ser Lys 
260 265 270 



Leu lie Glu Gly Lys Phe Gin Asp Met Leu Asn lie Ser Gin His Gin 
275 280 285 



Cys Val Lys Lys Gin Cys Pro Glu Asn Ser Gly Cys Phe Arg His Leu 
290 295 300 



Asp Glu Arg Glu Glu Cys Lys Cys Leu Leu Asn Tyr Lys Gin Glu Gly 
305 310 315 320 



Asp Lys Cys Val Glu Asn Pro Asn Pro Thr Cys Asn Glu Asn Asn Gly 
325 330 335 



Gly Cys Asp Ala Asp Ala Lys Cys Thr Glu Glu Asp Ser Gly Ser Asn 
340 345 350 



Gly Lys Lys lie Thr Cys Glu Cys Thr Lys Pro Asp Ser Tyr Pro Leu 
355 360 365 



Phe Asp Gly lie Phe Cys Ser His Asp Glu Leu Xaa Glu Leu Thr 
370 375 380 



<210> 5 

<211> 38 

<212> DNA 

<213> Artificial sequence 



6 



<220> 

<223> synthetic 



<400> 5 

cgggatccca aggagatacc atggcagtaa ctccttcc 



38 



<210> 6 

<211> 42 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic 

<400> 6 

ctgcgagctc ttattaaagc tcatcatgct gcagaaaata cc .42 



<210> 7 

<211> 28 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic 



<210> 8 

<211> 42 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic 

<400> 8 

ggtgagctct taaagctcat catgggagca gaagataccg tc 42 

<210> 9 

<211> 24 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic 



<400> 7 

ctgcgagctc ttattaatga tgatgatg 



28 



<400> 9 

gccgaattcg acaacatcct cagt 



24 



<210> 10 
<211> 42 
<212> DNA 



<213> Artificial sequence 



7 



<220> 

<223> synthetic 

<400> 10 

ggtgagctct taaagctcat catgggagca gaagataccg tc 

<210> 11 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic 

<400> 11 

agagctaatg tgcttgaatt gcataaggtc 

<210> 12 

<211> 30 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic 

<400> 12 

gaccttatgc aattcaagca cattagctct 



<210> 13 

<211> 1235 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic 

<400> 13 



attggatcca 


ctaaaatgtg 


gtcttggaag 


tgtcttttat 


tctgggctgt 


cttggtgacc 


gccactcttt 


gcacagcagc 


gatctctgtt 


actatggaca 


acatcctcag 


tggcttcgag 


aacgagtacg 


acgtaatcta 


cctaaagccc 


cttgccggtg 


tctaccgttc 


attgaagaaa 


cagatagaaa 


agaatatttt 


cacgttcaac 


ctcaacctaa 


atgacatcct 


caactcgcgc 


ctcaagaagc 


gaaaatactt 


cctcgacgtg 


ttggaatccg 


accttatgca 


atttaagcac 


attagctcta 


acgagtacat 


catagaggac 


agcttcaagc 


tcttgaattc 


agaacagaag 


aacaccctcc 


taaagtccta 


caaatacatt 


aaggagtctg 


ttgagaacga 


catcaagttc 


gcccaggaag 


gaattagcta 


ctatgagaaa 


gtcctggcta aatacaagga 


cgacttggaa 


agcattaaga 


aggtaatcaa 


agaagagaag 


gaaaagtttc 


cgagctctcc 


acccacaact 


cccccatcgc 


ctgcaaagac 


cgacgagcag 


aaaaaagaaa gtaagttcct 


tccattcctc 


accaacatcg 


aaactctata 


taacaacctg gtgaacaaga 


ttgatgacta 


cttaatcaac 



8 



ttgaaggcga 


aaattaatga 


ctgtaacgtc 


gaaaaggatg 


aagcccacgt 


taagatcacc 


720 


aagctttccg 


atctcaaagc 


catcgacgat 


aagattgacc 


tgtttaagaa 


ccacaacgat 


780 


ttcgacgcaa 


tcaaaaagtt 


gatcaacgac 


gatactaaga 


aagacatgct 


tggaaaactg 


840 


ctgtcgacag 


gcttggtcca 


aaacttcccg 


aacaccatta 


taagcaagct 


gatcgaagga 


900 


aagtttcagg 


atatgctgaa 


catctctcag 


catcaatgcg 


tgaagaagca 


atgtcccgag 


960 


aattcaggtS 


gcttccgcca 


cttagacgaa 


agggaggaat 


gtaaatgcct 


gctgaattat 


1020 


aaacaggaag 


gagacaagtg 


cgtagagaat 


cctaacccaa 


cctgtaacga 


aaataacggt 


1080 


ggctgcgatg 


ctgacgctaa 


gtgtaccgag 


gaggacagcg gttccaatgg caagaaaata 


1140 


acttgcgaat 


gcacgaagcc 


cgatagttac 


cctctcttcg 


acggtatctt 


ctgctcccca 


1200 


cctcatcatc 


atcatcatca 


ttaataaggt 


accta 






1235 



<210> 14 

<211> 402 ' v 
<212> PRT 

<213> Artificial sequence 
<220> 

<223> synthetic 
<400> 14 

Met Trp Ser Trp Lys Cys Leu Leu Phe Trp Ala Val Leu Val Thr Ala 
15 10 15 

Thr Leu Cys Thr Ala Ala lie Ser Val Thr Met Asp Asn lie Leu Ser 
20 25 30 

Gly Phe Glu Asn Glu Tyr Asp Val lie Tyr Leu Lys Pro Leu Ala Gly 
35 40 45 

Val Tyr Arg Ser Leu Lys Lys Gin He Glu Lys Asn He Phe Thr Phe 
50 55 60 

Asn Leu Asn Leu Asn Asp He Leu Asn Ser Arg Leu Lys Lys Arg Lys 
65 70 75 80 

Tyr Phe Leu Asp Val Leu Glu Ser Asp Leu Met Gin Phe Lys His He 
85 90 95 

Ser Ser Asn Glu Tyr He He Glu Asp Ser Phe Lys Leu Leu Asn Ser 
100 105 110 

Glu Gin Lys Asn Thr Leu Leu Lys Ser Tyr Lys Tyr lie Lys Glu Ser 

9 



115 



120 



125 



Val Glu Asn Asp lie Lys Phe Ala Gin Glu Gly lie Ser Tyr Tyr Glu 
130 135 140 



Lys Val Leu Ala Lys Tyr Lys Asp Asp Leu Glu Ser lie Lys Lys Val 
145 150 155 160 



lie Lys Glu Glu Lys Glu Lys Phe Pro Ser Ser Pro Pro Thr Thr Pro 
165 170 175 



Pro Ser Pro Ala Lys Thr Asp Glu Gin Lys Lys Glu Ser Lys Phe Leu 
180 185 190 



Pro Phe Leu Thr Asn lie Glu Thr Leu Tyr Asn Asn Leu Val Asn Lys 
195 200 205 



lie Asp Asp Tyr Leu lie Asn Leu Lys Ala Lys lie Asn Asp Cys Asn 
210 215 220 



Val Glu Lys Asp Glu Ala His Val Lys lie Thr Lys Leu Ser Asp Leu 
225 230 235 240 



Lys Ala lie Asp Asp Lys lie Asp Leu Phe Lys Asn His Asn Asp Phe 
245 250 255 



Asp Ala lie Lys Lys Leu lie Asn Asp Asp Thr Lys Lys Asp Met Leu 
260 265 270 



Gly Lys Leu Leu Ser Thr Gly Leu Val Gin Asn Phe Pro Asn Thr lie 
275 280 285 



lie Ser Lys Leu lie Glu Gly Lys Phe Gin Asp Met Leu Asn lie Ser 
290 295 300 



Gin His Gin Cys Val Lys Lys Gin Cys Pro Glu Asn Ser Gly Cys Phe 
305 310 315 320 



Arg His Leu Asp Glu Arg Glu Glu Cys Lys Cys Leu Leu Asn Tyr Lys 
325 330 335 



Gin Glu Gly Asp Lys Cys Val Glu Asn Pro Asn Pro Thr Cys Asn Glu 
340 345 350 



Asn Asn Gly Gly Cys Asp Ala Asp Ala Lys Cys Thr Glu Glu Asp Ser 
355 360 365 



10 



Gly Ser Asn Gly Lys Lys lie Thr Cys Glu Cys Thr Lys Pro Asp Ser 
370 375 380 



Tyr Pro Leu Phe Asp Gly lie Phe Cys Ser Pro Pro His His His His 
385 390 395 400 



His His 



<210> 15 
<211> 150 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic 
<400> 15 

aaaatgatat taaatttgca caggaaggta taagttatta tgaaaaggtt aacattgaga 
ccttatacaa taacttagtt aataaaattg acgattactt tccagaaaat tctggatgtt 
tcagacattt agatgaaaga gaagaatgta 



<210> 16 
<211> 142 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> synthetic 
<400> 16 

aatttcaaaa tgttttagaa tcagatttaa ttccatataa agatttaaca tcaagtaatt 
atgttgtcaa agatccatat aaatttctta ataaagaaaa aagagataaa ttcttaagca 
gttataatta tattaaggat tc 



<210> 17 

<211> 5 

<212> PRT 

<213> Artificial sequence 
<220> 

<22 3> linker consensus sequence 

<400> 17 

Gly Ser Gly Gly Ser 
1 5 



<210> 18 



11 



<211> 4 

<212> DNA 

<213> Artificial sequence 
<220> 

<223> linker consensus sequence 

<400> 18 
gggs 

<210> 19 

<211> 4 

<212> PRT 

<213> Artificial sequence 
<220> 

<223> endoplasmic reticulum retention signal 

<400> 19 

His Asp Glu Leu 
1 



12 



